Cross phone state clustering using lexical stress and context
نویسندگان
چکیده
This study deals with acoustic phonetic modelling in HMM based continuous speech recognition. Context dependent phone models were derived by a decision tree clustering algorithm. In particular, lexical stress was introduced as a clustering variable in addition to the phonetic context. The parameter sharing model was extended by tying HMM states across different target phones. For instance, one or more states of a tense vowel and the corresponding lax vowel were tied if they proved to be acoustically similar. The results indicate that the use of lexical stress information in acoustic modelling might be fruitful when large amounts of training data are available.
منابع مشابه
Lecture 5: Clustering and Adaptation
The state index should encode all context information that might influence the acoustics: the third state of the /t/ in “a tree” should be different from the third state of the /t/ in “a tip,” because they are acoustically different. Likewise, lexical stress, phrase position, glottalization, and dialect might matter. Unfortunately, we never have training examples sufficient to learn the likelih...
متن کاملImproved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملTrying to improve phone and word recognition using finely tuned phone-like units
Phone-like units (PLUs) for automatic speech recognition are derived using a decision tree algorithm. In our approach we use information such as target phone label, immediate context, lexical stress level and function word affiliation in the decision tree analysis. The resulting PLUs are shown to improve phone and word recognition.
متن کاملUnsupervised Cross-Lingual Lexical Substitution
Cross-Lingual Lexical Substitution (CLLS) is the task that aims at providing for a target word in context, several alternative substitute words in another language. The proposed sets of translations may come from external resources or be extracted from textual data. In this paper, we apply for the first time an unsupervised cross-lingual WSD method to this task. The method exploits the results ...
متن کاملImproving letter-to-pronunciation accuracy with automatic morphologically-based stress prediction
Robust text-to-speech (TTS) systems require a letter-topronunciation module for generating the pronunciations of words missing from the system lexicon. These pronunciations must specify not only the phone sequence that corresponds to an input orthography, but also the location of lexical stress. However, letter-to-pronunciation modules that make use of a window of context letters around a targe...
متن کامل